Clustering and Approximate Identification of Frequent Item Sets

نویسندگان

  • Selim Mimaroglu
  • Dan A. Simovici
چکیده

We propose an algorithm that computes an approximation of the set of frequent item sets by using the bit sequence representation of the associations between items and transactions. The algorithm is obtained by modifying a hierarchical agglomerative clustering algorithm and takes advantage of the speed that bit operations afford. The algorithm offers a very significant speed advantage over standard implementations of the Apriori technique and, under certain conditions, recovers the preponderant part of the frequent item sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New algorithms for finding approximate frequent item sets

In standard frequent item set mining a transaction supports an item set only if all items in the set are present. However, in many cases this is too strict a requirement that can render it impossible to find certain relevant groups of items. By relaxing the support definition, allowing for some items of a given set to be missing from a transaction, this drawback can be amended. The resulting it...

متن کامل

A Novel Approach for finding Frequent Item Sets with Hybrid Strategies

Frequent item sets mining plays an important role in association rules mining. Over the years, a variety of algorithms for finding frequent item sets in very large transaction databases have been developed. Therefore, a number of methods have been proposed recently to discover approximate frequent item sets. This paper proposes an efficient SMine (Sorted Mine) Algorithm for finding frequent ite...

متن کامل

Mining Fuzzy Frequent Item Sets

Due to various reasons transaction data often lack information about some items. This leads to the problem that some potentially interesting frequent item sets cannot be discovered, since by exact matching the number of supporting transactions may be smaller than the user-specified minimum. In this study we try to find such frequent item sets nevertheless by inserting missing items into transac...

متن کامل

Infrequent Weighted Item Set Mining Using Frequent Pattern Growth

Frequent item set mining is one of the popular data mining techniques and it can be used in many data mining fields for finding highly correlated item sets. Infrequent item set mining finds rarely occurring item sets in the database. Most of the Existing Infrequent item set mining techniques finds infrequent weighted item sets with high computing time and are less scalable when the database siz...

متن کامل

Fuzzy Frequent Item Set Mining based on Recursive Elimination

Real life transaction data often miss some occurrences of items that are actually present. As a consequence some potentially interesting frequent item sets cannot be discovered, since with exact matching the number of supporting transactions may be smaller than the user-specified minimum. In order to allow approximate matching during the mining process, we propose an approach based on transacti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007